NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Havoc Paradox in Generator-Based Fuzzing (Registered Report)

https://doi.org/10.1145/3678722.3685529

Li, Ao; Huang, Madonna; Lemieux, Caroline; Padhye, Rohan (September 2024, ACM)

Full Text Available
The Havoc Paradox in Generator-Based Fuzzing

https://doi.org/10.1145/3742894

Li, Ao; Huang, Madonna; Vikram, Vasudev; Lemieux, Caroline; Padhye, Rohan (June 2025, ACM Transactions on Software Engineering and Methodology)

Parametric generators combine coverage-guided and generator-based fuzzing for testing programs requiring structured inputs. They function as decoders that transform arbitrary byte sequences into structured inputs, allowing mutations on byte sequences to map directly to mutations on structured inputs, without requiring specialized mutators. However, this technique is prone to thehavoc effect, where small mutations on the byte sequence cause large, destructive mutations to the structured input. This paper investigates the paradoxical nature of the havoc effect for generator-based fuzzing in Java. In particular, we measure mutation characteristics and confirm the existence of the havoc effect, as well as scenarios where it may be more detrimental. Our evaluation across 7 real-world Java applications compares various techniques that perform context-aware, finer-grained mutations on parametric byte sequences, such as JQF-EI, BeDivFuzz, and Zeugma. We find that these techniques exhibit better control over input mutations and consistently reduce the havoc effect compared to our coverage-guided fuzzer baseline Zest. While we find that context-aware mutation approaches can achieve significantly higher code coverage, we see that destructive mutations still play a valuable role in discovering inputs that increase code coverage. Specialized mutation strategies, while effective, impose substantial computational overhead—revealing practical trade-offs in mitigating the havoc effect.
more » « less
Free, publicly-accessible full text available June 6, 2026
Gauss: program synthesis by reasoning over graphs

https://doi.org/10.1145/3485511

Bavishi, Rohan; Lemieux, Caroline; Sen, Koushik; Stoica, Ion (October 2021, Proceedings of the ACM on Programming Languages)

While input-output examples are a natural form of specification for program synthesis engines, they can be imprecise for domains such as table transformations. In this paper, we investigate how extracting readily-available information about the user intent behind these input-output examples helps speed up synthesis and reduce overfitting. We present Gauss, a synthesis algorithm for table transformations that accepts partial input-output examples, along with user intent graphs. Gauss includes a novel conflict-resolution reasoning algorithm over graphs that enables it to learn from mistakes made during the search and use that knowledge to explore the space of programs even faster. It also ensures the final program is consistent with the user intent specification, reducing overfitting. We implement Gauss for the domain of table transformations (supporting Pandas and R), and compare it to three state-of-the-art synthesizers accepting only input-output examples. We find that it is able to reduce the search space by 56×, 73× and 664× on average, resulting in 7×, 26× and 7× speedups in synthesis times on average, respectively.
more » « less
AutoPandas: neural-backed generators for program synthesis

https://doi.org/10.1145/3360594

Bavishi, Rohan; Lemieux, Caroline; Fox, Roy; Sen, Koushik; Stoica, Ion (October 2019, Proceedings of the ACM on Programming Languages)

Full Text Available
FuzzFactory: domain-specific fuzzing with waypoints

https://doi.org/10.1145/3360600

Padhye, Rohan; Lemieux, Caroline; Sen, Koushik; Simon, Laurent; Vijayakumar, Hayawardh (October 2019, Proceedings of the ACM on Programming Languages)

Full Text Available
JQF: coverage-guided property-based testing in Java

https://doi.org/10.1145/3293882.3339002

Padhye, Rohan; Lemieux, Caroline; Sen, Koushik (January 2019, Proceedings of the 28th {ACM} {SIGSOFT} International Symposium on Software Testing and Analysis, {ISSTA} 2019)

Full Text Available
Semantic fuzzing with zest

https://doi.org/10.1145/3293882.3330576

Padhye, Rohan; Lemieux, Caroline; Sen, Koushik; Papadakis, Mike; Le Traon, Yves (January 2019, Proceedings of the 28th {ACM} {SIGSOFT} International Symposium on Software Testing and Analysis, {ISSTA} 2019)

Full Text Available

Search for: All records